moving windows

Terms from Artificial Intelligence: humans at the heart of algorithms

Page numbers are for draft copy at present; they will be replaced with correct numbers when final book is formatted. Chapter numbers are correct and will not change now.

When applying windowing techniques in {time series}} or sequential data moving windows takes windows of a fixed length starting at every position, for example [1,2,3,4], [2.3.4.5], [3.4.5.6], etc. This is in contrast to non-overlapping windows. Moving windows are often used when wanting to make a prediction of the item following the window, for example, predictig data poiht 5 based on points 1 to 4. In time-series analysis moving averages do this with simple linaer predictors, often over a small number of past observations. Large-language modles use the same technique but with a deep nearal network to perform the prediction and window size that is mnay thousands, if not millions of tokens long.